Visualization of Categorical Longitudinal and Times Series Data.

نویسندگان

  • Stephen J Tueller
  • Richard A Van Dorn
  • Georgiy V Bobashev
چکیده

Plotting growth curves is a powerful graphical approach used in exploratory data analysis for continuous longitudinal data. However, plotted growth curves for multiple participants rapidly become uninterpretable with categorical data. Categorical data define specific states (e.g., being single, married, divorced), and these states do not necessarily need to represent any hierarchical order. Thus, a trajectory becomes a sequence of states rather than a continuum. We introduce a horizontal line plot that uses shade or color to differentiate between states on a categorical longitudinal variable for multiple participants. With appropriate sorting, stacking the horizontal lines that represent each participant can reveal important patterns such as the shape of, or heterogeneity in, the trajectories. We illustrate the plotting techniques for large sample sizes, observed groups, the exploration of unobserved latent classes, large numbers of time points such as are found with intensive longitudinal designs or multivariate time series data, individually varying times observation, unique numbers of observations, and missing data. We used the R package longCatEDA to create the illustrations. Illustrative data include both simulated data and alcohol consumption data in adult schizophrenics from the Clinical Antipsychotic Trials of Intervention Effectiveness.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Analysis of Dynamic Longitudinal Categorical Data in Incomplete Contingency Tables Using Capture-Recapture Sampling: A case Study of Semi-Concentrated Doctoral Exam

Abstract. In this paper, dynamic longitudinal categorical data and estimation of their parameters in incomplete contingency tables are evaluated. To apply the proposed method, a study has been conducted on the data of the semi-concentrated doctoral exam of the National Organization for Educational Testing (NOET). The results of studies such as the obtained confidence intervals and calculating t...

متن کامل

A New Method for Visualizing and Mining Repeated Measure Data (SEER): Relationship between Education and Career Paths

Data mining and visualization received much attention in recent decades due to technological advances. Analysts, engineers, researchers, scientists, and policy makers often encounter massive amounts of data. This paper introduces a visualization technique, SEER, developed for policy makers and researchers to graphically analyze and explore massive amounts of categorical data collected in longit...

متن کامل

Finding Comparable Patient Histories: A Temporal Categorical Similarity Measure with an Interactive Visualization

Finding similar patients within millions of Electronic Health Records (EHRs) is a challenging problem. A major challenge is how to define a similarity measure that capture the searchers intent. Many methods for computing a similarity measure between time series have been proposed, but patient history with temporal categorical data require fresh thinking. To address this problem, we propose a te...

متن کامل

Socioeconomic Determinants of Infant Mortality in Iranian Children: A Longitudinal Econometrics Analysis

MethodsUsing time series data of national level (1967 to 2012 years), we explored the association between total fertility rate, GDP per capita, number of physician per 1000 populations, female labor force participation rate, percentage of people living in rural regions and mean years schooling for each people with infant mortality rate of Iran. These data were obtained from Central Bank of Isla...

متن کامل

Categorical Data Visualization and Clustering Using Subjective Factors

A common issue in cluster analysis is that there is no single correct answer to the number of clusters, since cluster analysis involves human subjective judgement. Interactive visualization is one of the methods where users can decide a proper clustering parameters. In this paper, a new clustering approach called CDCS (Categorical Data Clustering with Subjective factors) is introduced, where a ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:
  • Methods report

دوره 2016  شماره 

صفحات  -

تاریخ انتشار 2016